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Abstract 

Jim Propp's P-machine, also known as the 'rotor router model' is a 
simple deterministic process that simulates a random walk on a graph. 
Instead of distributing chips to randomly chosen neighbors, it serves 
the neighbors in a fixed order. 

We investigate how well this process simulates a random walk. For 
the graph being the infinite path, we show that, independent of the 
starting configuration, at each time and on each vertex, the number of 
chips on this vertex deviates from the expected number of chips in the 
random walk model by at most a constant ci, which is approximately 
2.29. For intervals of length L, this improves to a difference of 0(log L) , 
for the L2 average of a contiguous set of intervals even to 0{^/\ogL). 
All these bounds are tight. 



1 The Propp Machine 

The following deterministic process was suggested by Jim Propp as an at- 
tempt to derandomize random walks on infinite grids Z*^: 
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Rules of the Propp machine: Each vertex j; G Z"^ is associated with a 
'rotor' and a cychc permutation of the 2d cardinal directions of Z*^. Each 
vertex may hold an arbitrary number of 'chips'. In each time step, each 
vertex sends out all its chips to neighboring vertices in the following manner: 
The first chip is sent into the direction the rotor is pointing, then the rotor 
direction is updated to the next direction in the cyclic ordering. The second 
chip is sent in this direction, the rotor is updated, and so on. As a result, 
the chips are distributed highly evenly among the neighbors. 

This process has attracted considerable attention recently. It turns out 
that the Propp machine in several respects is a very good simulation of 
a random walk. Used to simulate internal diffusion limited aggregation 
(repeatedly, a single chip is inserted at the origin, performs a rotor router 
walk until it reaches an unoccupied position and occupies it), it was shown 
by Levine and Peres |LP05j that this derandomization produces results that 
are extremely close to what a random walk would have produced. See also 
Kleber's paper |Kle05j . which adds interesting experimental results: Having 
inserted three million chips, the closest unoccupied site is at distance 976.45, 
the farthest occupied site is at distance 978.06. Hence the occupied sites 
almost form a perfect circle! 

In |CS05[ir!S04j . the authors consider the following question: Start with an 
arbitrary initial position (that is, chips on vertices and rotor directions), run 
the Propp machine for some time and compare the number of chips on a 
vertex with the expected number of chips a random walk run for the same 
amount of time would have placed on that vertex. Apart from a technicality, 
which we defer to the end of Section|21 the answer is astonishing: For any grid 
Z'^, this difference (discrepancy) can be bounded by a constant, independent 
of the number of chips, the run-time, the initial rotor position and the cyclic 
permutation of the cardinal directions. 

In this paper, we continue this work. We mainly regard the one-dimensional 
case, but as will be visible from the proofs, our methods can be extended 
to higher dimensions as well. Besides making the constant precise (approx- 
imately 2.29), we show that the differences become even better for larger 
intervals (both in space and time). We also present a fairly general method 
to prove lower bounds (the 'arrow forcing theorem'). This shows that all our 
upper bounds are actually sharp, including the aforementioned constant. 

Instead of talking about the expected number of chips the random walk 
produces on a vertex, we find it more convenient to think of the following 
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'linear' machine. Here, in each time step each vertex sends out exactly the 
same (possibly non-integral) number of chips to each neighbor. Hence, for 
a given starting configuration, after t time-steps the number of chips in the 
linear model is exactly the expected number of chips in the random walk 
model. 



2 Our Results 

We obtain the following results (again, see the end of the section for a slight 
technical restriction): Fix any starting configuration, that is, the number 
of chips on each vertex and the position of the rotor on each vertex. Now 
run both the Propp machine and the linear machine for a fixed number 
of time-steps. Looking at the resulting chip configurations, we have the 
following: 

• On each vertex, the number of chips in both models deviates by at most 
a constant ci ~ 2.29. One may interpret this to mean that the Propp 
machine simulates a random walk extremely well. In some sense, it is 
even better than the random walk. Recall that in a random walk a 
vertex holding n chips only in expectation sends n/2 chips to the left 
and the right. With high probability, the actual numbers deviate from 
this by 0(ni/2)_ 

• In each interval of length L, the number of chips that arc in this 
interval in the Propp model deviates from that in the linear model by 
only O(logL) (instead of, e.g., 2.29L). 

• If we average this over all length L intervals in some larger interval of 
Z, things become even better. The average squared discrepancy in the 
length L intervals also is only O(logL). 

We may as well average over time. In the setting just fixed, denote by 
f(x,T) the sum of the numbers of chips on vertex x in the last T time steps 
in the Propp model, and by E{x, T) the corresponding number for the linear 
model. Then we have the following discrepancy bounds: 

• The discrepancy on a single vertex over a time interval of length T is 
at most \f{x,T) — E{x,T)\ = 0(r^/^). Hence a vertex cannot have 
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too few or too many chips for a long time (it may, however, alternate 
having too few and too many chips and thus have an average 
discrepancy over time). 

We may extend this to discrepancies in intervals in space and time: 
Let / be some interval in Z having length L. Then the discrepancy in 
/ over a time interval of length T satisfies 

\2^nx,-L} Z^^l^'^J - I o(Tlog(Lr-V2)) otherwise. 

Hence if L is small compared to T^/^ , we get L times the single vertex 
discrepancy in a time interval of length T (no significant cancellation 
in space); if L is of larger order than T^^"^, we get T times the 0(log L) 
bound for intervals of length L (no cancellation in time, the discrep- 
ancy cannot leave the large interval in short time). 



All bounds stated above arc sharp, that is, for each bound there is a starting 
configuration such that after suitable run-time of the machines we find the 
claimed discrepancy on a suitable vertex, in a suitable interval, etc. 

A technicality: There is one limitation, which we only briefly mentioned, 
but without which our results are not valid. Note that since Z*^ is a bipartite 
graph, the chips that start on even vertices never mix with those which start 
on odd positions. It looks as if we would play two games in one. This is 
not true, however. The even chips and the odd ones may interfere with each 
other through the rotors. Even worse, we may use the odd chips to reset the 
arrows and thus mess up the even chips. Note that the odd chips are not 
visible if we look at an even position after an even run-time. An extension 
of the arrow-forcing theorem presented below shows that we can indeed use 
the odd chips to arbitrarily reset the rotors. This is equivalent to running 
the Propp machine in an adversarial setting, where an adversary may decide 
each time where the extra odd chips on a position is sent to. It is clear that in 
this setting, the results above cannot be expected. We therefore assume that 
the starting configuration has chips only on even positions ("even starting 
configuration" ) or only on odd positions ( "odd starting configuration" ) . An 
alternative, in fact equivalent, solution would be to have two rotors on each 
vertex, one for even and one for odd time steps. 
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3 The Basic Method 



For numbers a and h set [a..h\ = {z ^ 'L\a < z < h} and [6] = [1..6]. For 
integers m and n, we write m ~ n if m and n have the same parity, that is, 
if m — n is even. 

For a fixed starting configuration, we use f{x,t) to denote the number of 
chips at time t at position x and arr(x, t) to denote the value of the arrow 
at time t and position x, i.e., +1 if it points to the right, and —1 if it points 
to the left. We have: 

f{x,t + l) = f{x~l,t)/2 + f{x + l,t)/2 

+arr(.x - l,t)(/(.T - l,t) mod 2)/2 
-arr(x + l,t)(/(x- + l,t) mod 2)/2, 
ARR(x,t + l) = (-l)^(^'*)ARR(x,t). 

Note that after an even starting configuration if x ~ t does not hold, then 
we have f{x,t) = and ARR{x,t + 1) = ARR{x,t). 

We consider the machine to be started at time t = 0. Being a deterministic 
process, the initial configuration (i.e., the values f{x,0) and arr(x, 0), x € 
Z) determines the configuration at any time t > (i.e., the values f{x,t) 
and arr(x, t), x E Z). The totality of all configurations for t > we term a 
game. We call a configuration even if no chip is at an odd position. Similarly, 
a position is odd if no chip is at an even position. Clearly, an even position 
is always followed by an odd position and vice versa. 

By E{x,t) we denote the expected number of chips on a vertex x after 
running a random walk for t steps (from the implicitly given starting con- 
figuration). As described earlier, this is equal to the number of chips on x 
after running the linear machine for t time-steps. 

In the proofs, we need the following mixed notation. Let E{x,ti,t2) be the 
expected number of chips at location x and time t2 if a simple random walk 
were performed beginning from the Propp machine's configuration at time 
ti. In other words, this is the number of chips on vertex x after ti Propp 
and t2 — ti linear steps. 

Let H{x,t) denote the probability that a chip arrives at location x at time 
i > in a simple random walk begun from the origin, i.e., H(x,t) = 
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2~* ((t+x)/2) , if i ~ a;, and H{x, t) = otherwise. For t > let lNF(y, t) 
denote the "influence" of a Propp step of a single chip at distance y with 
t linear steps remaining (compared to a linear step). More precisely, we 
compare the two probabilities that a chip on position y reaches if (a) it is 
first sent to the right (by a single Propp step) and then does a random walk 
for the remaining t — 1 time steps, or (b) it just does t random walk steps 
starting from y. Hence, 

INF(2/,t) :=H{y + l,t-l)-H{y,t). 

A simple calculation yields 

INF(y,t)=-|i?(y,t). (1) 

This shows in particular, that lNF(y, t) < for y > and inf(7/, t) > Q for 
y < 0. We have inf(0, t) = 0. 

For notational convenience we extend the definitions of H{x,t) and INF(a;,i) 
by letting H{x, t) = for t < and iNF(a;, i) = for i < 0. 

Note that 

INF(y, t) = \H{y + 1, t - 1) - \H{y -l,t-l). (2) 

Therefore, the first Propp step with arrow pointing to the left has an influ- 
ence of — lNF(y, t). 

Using this notation, we can conveniently express the (signed) discrepancy 
f{x,t) — E{x,t) on a vertex x using information about when "odd splits" 
occurred. It suffices to prove the result for the vertex x = 0. Clearly, 
E{<d, t, t) = /(O, t) and E{0, 0, t) = E{0, t), so that 

i-l 

/(O, t) - E{0, t)=Y, iE{0, s + l,t)- E{0, s, t)) . (3) 

In comparing £'(0, s + l,t) and E{0, s, t), note that whenever there are two 
chips on some vertex at time s, then these chips can be assumed to behave 
identically no matter whether the next step is a linear or a Propp step. 
Denote by odDs the set of locations which are occupied by an odd number 
of chips at time s. Then 

E{Q,s + l,t) - E{Q,s,t) 

= iH{y + ARR{y,s),t-s-l)-H{y,t-s)) 

yeoDDs 

= J2 ARR(y,s)lNF(y,i-s). 

2/eODDs 
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Therefore, appealing to 



t-i 

f{Q,t)-E{Q,t)=Y, ARR(y,s)iNF(y,t-s). 

Using INF(y,ti) = for n < we can extend the summation above for all 
non- negative integers s. 

Let Si{y) be the z*^ time that y is occupied by an odd number of chips, 
beginning with i = 0. Switching the order of summation and noting that 
the arrows flip each time there is an odd number of chips on a vertex, we 
have 

/(O, t) - E{0, t) = Y,J2 ^^^^2/' ^^(2/)) iNF(y, t - Si{y)) 

= 0) ^^(-1)' iNF(y, t - s,{y)). (4) 

yeZ i>0 

This equation will be crucial in the remainder of the paper. It shows that 
the discrepancy on a vertex only depends on the initial arrow positions and 
the set of location-time pairs holding an odd number of chips. 

In the remainder, we show that we can construct starting configurations 
with arbitrary initial arrow positions and odd number of chips at arbitrary 
sets of location-time pairs. This will be the heart of our lower bound proofs 
in the following sections. Here No denotes the set of non-negative integers. 

Theorem 1 (Parity- forcing Theorem). For any initial position of the 
arrows and any vr : Z x Nq ^ {0, 1}, there is an initial even configuration of 
the chips such that for all x G Z, t £ Nq such that x ^ t, f{x, t) and 7r(x, t) 
have identical parity. 

Since rotors change their direction if and only if the vertex has an odd 
number of chips, the parity-forcing theorem is a consequence of the following 
arrow- forcing statement. 

Theorem 2 (Arrow-forcing Theorem). Let p{x,t) G { — 1,-1-1} be arbi- 
trarily defined for t > integer and x ^ t. Then there exists an even initial 
configuration that results in a game with arr(x, t) = p{x, t) for all such x 
and t. Similarly, if p{x, t) is defined for x ^ t + 1 a suitable odd initial 
configuration can be found. 
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Proof. By symmetry, it is enough to prove the first statement. 

Assume the functions / and arr describe the game following an even ini- 
tial configuration, and for some T > 0, we have ARR{x,t) = p{x,t) for 
all < f < T + 1 and a; ~ t. We modify the initial position by defining 
/'(x,0) = f{x,0) + £2:2"^ for even x, while we have f'{x,0) = for odd x 
and ARR'(a;, 0) = ARr(x, 0) for all x. Here, ex G {0, 1} are to be determined. 

Observe that a pile of 2^ chips will split evenly T times so that the arrows 
at time t < T remain the same. Our goal is to choose the values so 
that ARR'(x,t) = p{x,t) for < t < r + 2 and x ^ t. As stated above 
this holds automatically for t < T as ARR'{x,t) = ARR{x,t) = p{x,t) in 
this case. For t = T + 1 and x — T — 1 even we have AR,r'(.x,T + 1) = 
ARR'(x,r) = ARR(a;,r) = ARR(x,r + 1) = p{x,T + 1) since we start with 
an even configuration. To make sure the equality also holds for t = T + 2 we 
need to ensure that the parities of the piles f'{x, T) are right. Observe that 
ARr'(x,T + 2) = ARr'(x,T) if f'ix,T) is even, otherwise ARr'(.x,T + 2) = 
— ARR'(a;,T). So for x — T even we must make f'{x,T) even if and only if 
p{x,T + 2) = p{x, T). At time T the "extra" groups of 2^ chips have spread 
as in Pascal's Triangle and we have 



where x ~ T and the sum is over the even values of y with ly — x| < T. As 
f{x,T) are already given it suffices to set the parity of the sum arbitrarily. 
For T = the sum is Cx so this is possible. For T > we express 



where h depends only on ey with x — T<y<x + T. We now determine 
the €y sequentially. We initialize by setting = for — T < y < T. The 
values Cy for y > T are set in increasing order. The value of Cy is set so that 
the sum at x = y — T (and thus f'{y — T,T)) will have the correct parity. 
Similarly, the values for y < —T are set in decreasing order. The value of 
€y is set so that the sum at x = y + T (and thus the f'{y + T,T)) will have 
the correct parity. 

Note that the above procedure changes an even initial configuration that 
matches the prescription in p for times < t < T + 1 into another even 
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initial configuration that matches the prescription in p for times < t < 
T + 2. We start by defining f{x,0) = for all x (no chips anywhere) and 
arr(3;, 0) = p{x, 0) for even x, while arr(x, 0) = p{x, 1) for odd x. We now 
have arr(x, t) = p{x, t) for < t < 1 and x ~ t. We can apply the above 
procedure repeatedly to get an even initial configuration that satisfies the 
prescription in p for an ever increasing (but always finite) time period < 
t < T. Notice however, that in the procedure we do not change the initial 
configuration of arrows arr(x, 0) at all, and we change the initial number 
of chips f{x,0) at position x only if |x| > T. Thus at any given position 
X the initial number of chips will be constant after the first |x| iterations. 
This means that the process converges to an (even) initial configuration. It 
is simple to check that this limit configuration satisfies the statement of the 
theorem. □ 



4 Discrepancy on a Single Vertex 

Theorem 3. There exists a constant ci « 2.29, independent of the initial 
(even) configuration, the time t, or the location x, so that 



The proof needs the following elementary fact. Let X C M. We call a 
mapping / : X — > M unimodal, if there is an m E X such that / is mono- 
tonically increasing in {x ^ X \ x < m} and / is monotonically decreasing 
in {x £ X \ x > m}. 

Lemma 4. Let f : X be non-negative and unimodal. Let ti, . . . ,tn G X 
such that ti < ... < tn. Then 



Proof of Theorem\^ It suffices to prove the result for x = 0. In case t is even 
we start with an even configuration, if t is odd, then with an odd configura- 
tion (otherwise both f{0,t) and E{0,t) would be zero with no discrepancy). 

First we show that iNF(y, u) with a fixed y < is a non-negative unimodal 
function of u if restricted to the values u ^ y. We have already seen that it 



\f{x,t)-E{x,t)\ <ci. 
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is non- negative. For the unimodality let y < and n > 2, n ~ y. We have 

lNF(y, u) - lNF(y, u + 2) = --H{y, u) + -^H{y, u + 2) 

u u + 2 

4 + - 2/2 
{u + 2-y){u + 2 + y) 

whenever u > y. Hence the difference is non-neg ative if li > (y^ _ 4)/3 
and it is non-positive if n < (y^ — 4)/3. Thus we have unimodahty, with 
iNF(y, u) taking its maximum at the smallest value of u exceeding (y^ — 4) /3 
with u ^ y. Let tmax (y) := L(?/^ ~ 4)/3j-|-2. It is easy to check that 

^max (y) 

~ y always holds, so we have that iNF(y, u) takes its maximum for 
fixed y < at n = tmax(?/)- For y > the values lNF(y, u) are non-positive 
and by symmetry the minimum is taken at u = tmax iv)- For y = we have 
iNF(y, n) = for all u. We have just proved the following: 

Lemma 5. For y G the function |iNF(y, t)\ is maximized over all integers 

t at Wx(y) = L(y^-4)/3j +2. 

To bound |/(0, t) — E(0, t)\ we use the formula (jU where the inner sums are 
alternating sums, for which we can apply Lemma 0J as y ~ t — Si(y) holds 
by our even or odd starting position assumption. We get 



< 



\f{0,t)-E{0,t)\<Y, ^(-iriNF(y,t-5,(y)) 

yeZ i>0 

Emax |lNF(y, u)\ 
u 



oo 



Here 

|lNF(y,tmax(y))| 



2^|lNF(y,tmax(y))|- (5) 
y=l 



imax(y) V(Wx(y)+y)/2 



= 0(y/(W(y))'/') = 0(y-'). 
and, therefore, © implies that |/(0, t) — E{0,t)\ is bounded by 

oo 

ci :=2j;|lNF(y,C(y))| «2.29, 

y=l 

proving Theorem |31 □ 
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Amazingly, the constant ci defined above is best possible. Indeed, let y > 
be arbitrary and even and let to = tmax(y)- We apply the Arrow- forcing 
Theorem to find an even starting position that makes arr(x, t) = — 1 if x > 
and t < to — traax (x) OT X < and t > to — tmax (x) and makes arr(x, t) = —1 
otherwise. It is easy to verify that in this case at a position |x| < y, x ^ 
we have an odd number of chips exactly once at time to ~ ^max (x) and the 
formula Q gives 

y 

/(O, to) - E{0, to) = 2 ^ |inf(x, t„,ax {x))\. 

x=l 

5 Intervals in Space 

In this section, we regard the discrepancy in intervals in Z. For an arbitrary 
finite subset X of Z set 

f{X,t) := J^/(x,t), 

E{X,t) := Y,E{x,t). 

xex 

We show that the discrepancy in an interval of length L is O(logL), and 
this is sharp. We need the following facts about H. 

Lemma 6. For all x £ Z, H{x,-) : {t G No | x ~ t} ^ R; t i-^ H{x,t) is 
unimodal. H{x,t) is maximal for t = x'^ . We have H{x,x'^) = 0(|x|~^). 

Proof. Since H{x,t - 2) - H{x,t) = ^^^H{x,t), we conclude that H{x,t) 
is unimodal and for |x| > 2 it has exactly two maxima, namely t = x'^ — 2 
and t = x^ , while for |x| < 1 the latter is the only maximum. A standard 
estimate gives the claimed order of magnitude. □ 

Theorem 7. For any even initial configuration, any time t and any interval 
X of length L, 

\f{X,t)-E{X,t)\=0{logL). 

For every L > there is an even initial configuration, a time t and an 
interval X of length L such that 

\f{X,t)-E{X,t)\=n{logL). 
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Proof. Using that the discrepancy of a single position is bounded we can 
assume X ends at an even position, and then by symmetry we may assume 
it ends at 0, i.e., X = [—L + 1..0]. Fix any even initial configuration. By 
(jlj, we have 

f{X, t) - E{X, t)=Y, 0) ^(-l)*iNF(y -x,t- s,{y)). 

Note that the summation here can be restricted to values x ~ t, the other 
values contribute zero. 

Let us call 

CON{y) := ARR(y, 0) ^ ^(-1)*inf(2/ -x,t- Si{y)) 

x£X i>0 

the contribution of the vertex y to the discrepancy in the interval X. The 
contribution of a vertex depends on its distance from the interval X. If y is 
0(L) away from X, its infiuences on the various vertices of X are roughly 
equal, and all such influences are quite small. In this case we bound its 
influence by L times the one we computed in Theorem |2l 

Let y > L. By Lemmas and El 



|C0N(y)| 



Yl '^(-'^y^^^iy -x,t- Siiy)) 

< Yl J](-l)*iNF(2/-x,t-Si(y)) 

x£X i>0 

< ^ max |lNF(y — x,t)\ 

x£X 

<0(Y.{y-xrA=0{Ly-^). 

^x&X ' 



Hence the total contribution of these vertices is at most 
^|C0N(y)| = 0(^Ly-2] =0(1) 



and by symmetry the same bound applies to the contribution of vertices 
y < -2L. 
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We now turn to vertices —2L<y<L. Here mainly those vertices of X 
that are close to y contribute to CON(y). Hence, the approach above is too 
coarse. We use instead that yields a collapsing sum. To simplify our 
formulas we introduce 



Note that H'{x,t) = H{x,t) for x ^ t and H'{x,t) = H{x — l,t) otherwise. 
Also note that H'{x,t) is not unimodal in t, but fixing x and restricting t 
to only even or only odd values it becomes unimodal. As Si{y) ~ y we can 
still apply Lemma |1] below. 

Using ((2j and Lemmas 0] and El we have 



H'{x, t) = H{x -l,t) + H{x, t). 



CON(y)| 



i>0 x6X 



iE(-i)^E [^(y-^ + l'*- 



s^(y) - 1) 



i>0 xGX 



H{y -x-l,t- Si{y) 



iy[H\y + L,t-s,{y) 



1) 



H'{y,t-s^{y)-l)] 



< lY^{-iyH'{y + L,t-s,{y)-l) 



i>0 



< i maxH'iy + L, s) + ^ maxi^'f-u, s) 




= 0(l/(y + L-l/2))+0(l/(y-l/2)). 



Thus the vertices in [— 2L + 1..L] contribute at most 



2L 



|C0N(y)| = l/(z - 1/2)) = O(logL). 



2yG[-2L+l..L] i=\ 
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Combining all cases, we have 

\f{X,t)-E{X,t)\ < ^|C0N(y)| = O(logL). 

For the lower bound, we just have to place the chips in a way the logarithmic 
contribution actually occurs. Without loss of generality, let L be odd. 

Consider the following initial configuration (its existence is ensured by the 
parity forcing theorem): All arrows point towards the interval X (arrows 
of vertices in X may point anywhere). Let t = L^. Choose an initial 
configuration of the chips such that f{y, s) is odd if and only if y G [L] is 
even and t — s = y^. 

Now by construction, COn(7/) = for all y G Z \ [L]. For ye [L], we have 

COn(j/) = - ^ iNF(y - X, y^) 

xex 

= \H{yy)-\H{y + L+l,y'') 

> lH{y, y2) - ^-H{y + L+l,{y + L + if) 

= ^iy-')- 

Hence for this initial configuration, 

f{x,t)-E{x,t) = J2co^{y)= E o{y-') = n{logL). 

□ 

6 Intervals in Time 

In this section, we regard the discrepancy in time-intervals. For x G Z and 
finite -S C No, set 

tes 

E{x,S) := E^(^'^)- 
tes 

We show that the discrepancy of a single vertex in a time-interval of length 
T is 0{VT), and this is sharp. 
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Theorem 8. The maximal discrepancy \f{x,S) — E{x,S)\ of a single ver- 
tex X in a time interval S of length T is 0(r^/^). 

In the proof, we need the following fact that "rolling sums" of unimodal 
functions are unimodal again. 

Lemma 9 (Unimodality of rolling sums). Let f : Z ^ R be unimodal. 
Let fc G N. Define F : Z — > M 6?/ F{z) = Yli=o Then F is unimodal. 

Proof. Let / and m G Z be such that / is non-decreasing in Z<ot and non- 
increasing in Z>m. We show that for some m, — k < AI < m we have that 
G{x) := F{x + 1) — F{x) is nonnegative for x < M and nonpositive for 
X > M. This implies that F is unimodal. 

Since G{x) = f{x + k) — f{x) for all a; G Z, G{x) is non-negative for x < 
m — k and it is nonpositive for x > m. For m — k < x < m we have 
G{x + 1) - G{x) = {fix + k + l) - f{x + k)) - {fix + 1) - fix)) < 0, that 
is, G is non-increasing in [m — k..m]. Hence M exists as claimed. □ 

Of course, analogous statements hold for functions defined only on even or 
odd integers. 

The following result says that a single odd split has an influence of exactly 
one on another vertex over infinite time. 

Lemma 10. For allx eZ \ {0}, ^^^^ |lNF(x,t)| = 1. 

Proof W.l.o.g., let x G N. Then |lNF(x,t)| = ^H{x - l,t - I) - ^H{x + 
l,t — 1). Consider a random walk of a single chip started at zero. Let Xy^t 
be the indicator random variable for the event that the chip is on vertex y 
at time t. Let Yy^t be the indicator random variable for the event that the 
chip is on vertex y at time t and that it has not visited vertex x so far. Let 
T denote the first time the chip arrives at x. 

For any t > s > we have by symmetry that Pr{Xx-i^t-i = MT = s) = 
Pr(Xa;_|_i^i_i = l|r = s). Clearly, for t < T, Xx+i^t-i = 0, and for t > T, 



15 



yx-^i,t-i = 0. Thus 



ten 

\ ^ Pr(r = s)Y^ - I T 



ten ten 
_ 1 

2 



sgN teN 

2 



sen te[s\ 



\ Pr(T = s)e(Y1 1^ = 

sgn ^ ten ^ 



■ teN 

Note that E{^^f^^Yx-\^t-\) is just the expected number of visits to x — 1 
before visiting x. This number of visits is exactly k if and only if the chip 
moves left after each of its first k — 1 visits and right after the A;th visit. 
This happens with probability 2~'^. Hence E(J2f.^-^ Yx-i^t-i) = J2ien ~ 
2. ' □ 



Proof of Theorem\^ Fix any even initial configuration. Let to £ I^o and 
S =[to..to + T-l]. Without loss, let x = 0. By 0, we have 

f{0,S)-E{0,S) = j;(/(0,t)-i?(0,t)) 

tes 

yez i>0 teS 

By unimodality of rolling sums (Lemma I^J , 



\fiO,S)-E{0,S)\ < max ^iNF(y,t-s) 

yez * teS 

We estimate the term max^gN I S4g5iNF(y,t — s)| for all y. For 1 < |y| < 
T^/^, we use Lemma ITUl and simply estimate 



max 



^lNF(y,t-s) < ^|lNF(y,t)| = 1. 

tes ten 



(6) 
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For \y\ > T^/^ 



max 



^lNF(y,t-s^ 



by Lemma El Hence 



< rmc^|iNF(y,t)| = TO{y~ 



1/(0, 5) - E{0, 5)1 < Yl 1 + ^ E 0{y-') = 0{T^I^). 

l<|y|<Tl/2 |y|>Tl/2 

For the lower bound, we invoke the parity forcing theorem again. By this, 
there is an even initial configuration such that all arrows point towards 
zero, and such that there is an odd number of chips on vertex x € Z at time 
t G No if and only if x G X := [VT .. 2\/r] and t = AT-x^. For this initial 
configuration and S = [AT + 1 . . 5T] , we compute 

|/(0,5)-i?(0,5)| 

t&Sy&X 

> (l/2)r3/2min{|lNF(y,t)| \y £ X,t G S,y ^ t} 



□ 



7 Space-Time-Intervals 



We now regard the discrepancy in space-time- intervals. Extending the pre- 
vious notation, for finite X C Z and finite 5 C No set 

f{X,S) := EE/(^'*)' 

E{X,S) := 

Theorem 11. Let X C Z and S C Nq &e finite intervals of lengths L and T, 
respectively. Then the maximal discrepancy \f{X,S) — E{X,S)\ (taken over 
all odd or even initial configurations) is 0(Tlog(LT^^/^)), if L > 2T^/^, 
and 0(LT^/^) otherwise. 
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Proof. For the upper bound we use Theorems [7| and |H1 To prove \f{X, S) — 
E(X,S)\ = 0{LT'^/'^) we can simply apply Theorem|Sl 

\f{X,S)-E{X,S)\ < ^\f{x,S)-E{x,S)\ 
< LOiT^'"^). 

For the other upper bound \f{X,S)-E{X,S)\ = 0{T\og{LT-^/'^)) we have 
to separate contributions of the vertices and apply the bounds in the proof 
of Theorem [71 for most of them and the bounds from the proof of Theorem |S1 
for the rest. 

Fix an even initial configuration. Without loss of generality, let X = [—L + 
1..0]. Let Iq G No and S = [t^ ..t^ + T — 1\. As in previous proofs, by @ we 
have /(X, S) - E{X, S) = E^ez CON(y) with 

CON(y) := ARR(y, 0) J^(-l)* Yl Yl ^''^^ -x,t- Si{y)). 
i>o xgx tes 

Here CON(y) is the sum for t £ S of the contribution CONt(y) of y to the dis- 
crepancy of the interval X at a single time step t. The bound we established 
in the proof of Theorem[7|is |cONt(y)| = 0{Ly~'^) for y > L and y < —2L 
and |C0Nt(y)| = 0{l/{y - 1/2) + l/(y + L - 1/2)) for -2L < y < L. Thus 
we have 

|C0N(y)| = 0{LTy-^) 
for y > L and y < —2L and 

|C0N(y)| = 0{T/{y - 1/2) + T/{y + L + 1/2)) 

for -2L <y <L. 

The above bounds are the largest for y close to or —L. For \y\ < T^/^ 
and for |y + L| < T^/^ we bound |cON(y)| in a different way. Let X' be the 
interval [-L+2\T'^/'^]..-2\T'^/'^]] or empty if -L+2[ri/2] > -2\T'^/'^]. We 
express the contribution CON(y) of y as the sum of contributions to different 
parts of X. Let CON'(y) be the total contribution of the vertex y to the 
discrepancy in X' over the time interval S. Since y is separated from X' by 
at least T^/^ the above bound gives CON'{y) = 0(T"^/^). Let CON^(y) be 
the total contribution of y to the discrepancy of the single vertex x £ X\X' 
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over the time interval S. To bound CON^(y) we apply the technique of the 
proof of Theorem |HJ by Lemma we have |cON^.(y)| < 1. Thus we have 

|C0N(y)| < |co<.(y)| + |C0<(y)| = 0(Ti/2) + 0(TV2) = 0(TV2). 

xex\x' 

Let Hi be the set of vertices y with y < —2L or y > L. The total contribu- 
tion of these vertices is at most 

5; |C0N(y)|= 0{LTy-^)=0{T). 

Let H2 be the set of vertices y with \y\ < T^/"^ or |y + L| < T^^"^. The total 
contribution of these vertices is at most 

5^ |coN(y)|= ^ o(rV2) = o(r). 

Let be the the set of vertices y outside Hi and i?2- Their total contibution 
is bounded by 

|C0N(y)| = Y 0{T/y + T/{y + L)) 

= o(t J] iA| =o(riog(L/ri/2)). 

Finally we have 

= 1 J]cON(y)| 

< 0{T) + 0(T) + 0{Tlog{L/T^/^)) 

= o(riog(L/ri/2)). 

We now prove the corresponding lower bounds. Assume first that L > 2T^/^. 
Set Y = [TV2..L]. Choose an even initial configuration such that f{x,t) 
is odd if and only if x S y and t = — x^. Direct all arrows towards 
zero. Let S = [L^.. + T — 1]. Then for y £ Y, with appropriately chosen 
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6t,et £ {0, 1} we have 

L^+T-l 

C0n(2/) = Yj Yj |lNF(2/-X,i- 

y^+T-l 

> \ iH{y-l + et,t-l)-H{y + L-l-6t,t-l)) 

> ^[ H{y-l + eut-l)\ 

For y , CON(y) = 0. Hence the discrepancy in this setting is 

L 

^CON(y)= Y ^{Ty'') = ^{T\og{LT~^l^)). 



Assume now that L < 2T^/^. The setting of Theorem |H1 works for this lower 
bound, too. Choose an initial configuration such that f{x,t) is odd if and 
only if X E X := [TVa.. 2ri/2] and t = AT-y^. Then 

5T-1 

CON(y) = YY1 - X, t - (4r - 

xeX t=4T 

> Lrmin||lNF(y,t)| y G Zn [T^^^ ..3T^^^],t e Z n [T..5T],y t| 
= 0(L) 

for all y G y. Again, CON(y) = for y ^ T. Hence Ylyi^z CON(y) = 
0(LrV2). □ 

8 Intervals in Space, Revisited 

We stated in Theorem [7| that the discrepancy in an interval of length L is 
O(logL). Here we show that intervals of length L with about logL dis- 
crepancy are very rare, the root-mean-squared (i.e., quadratic) average of 
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the discrepancies of a long contiguous set of intervals of length L is only 
0(\/log L), and this bound is tight. 

For a set X of vertices we denote by Disc(X, t) the discrepancy of the set 
X at time t, i.e., we set Disc(X,t) = f{X,t) - E{X,t). 

Theorem 12. Let X be an interval of length L. For M sufficiently large, 

M 

— ^Disc2(X + A:,t) = 0(logL). 
fe=i 

Furthermore, for a given L and M there exists an even initial configuration, 
and a time t and an interval X of length L such that 

1 *^ 

— ^Bisc\x + k,t) = n{\ogL). 

k=l 

Proof. For the first statement we need to prove an 0{\/log L) bound on the 
quadratic average of the discrepancies Disc(X + k,t) with k = 1, . . . ,M. 
First note that by changing the individual discrepancies by a bounded 
amount, we change the quadratic average by at most the same amount. 
We use this observation to freely neglect 0(1) terms in the discrepancy 
of the intervals. In particular we can change the intervals themselves by 
adding or deleting a bounded number of vertices. We use this to make a few 
simplifying assumptions. As in Section [7| we assume that (i) the starting 
configuration is odd, (ii) the interval X \s X = \—L'..L'] with V ~ t, and 
(iii) M is even and we only consider even values of k, i.e., we consider the 
average of Disc2(X + A;,t) for 2 < < M, /c even (this can be justified by 
considering X + k + 1 instead oi X + k for odd k). 

First we show that discrepancies caused by odd piles at time t — L? or 
before can be neglected. We start with ((IJ) for the individual discrepancies 
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DISC(x,t). 

DlSC(x,t) = ^ARR(y,0)^(-l)*lNF(y-3;,i-Si(y)) 

= DlSCi(x,i) + DlSC2(a;,i); 

DisCi(x,t) = ^ARR(y,0) {-iymF{y-x,t-Si{y)y, 
y& s,(s/)>t-L2 



|DISC2(x,t)| 



= ^ARR(y,0) Yl (-l)'iNF(y-x,t- Si(y)) 

y& s,{y)<t-L^ 
< y max |lNF(y — X, ii)|. 



We have seen that |inf(z, u)| is unimodal for fixed z and its maximum is at 
u = so we have 

L 

|DisC2(x,t)| < 2^|inf(z,l2)| +2^ |inf(z, [zV3l)| 

2=1 Z>L 

z=l z>L 
L 

< 2Yh{z,L^)/L + 0{1/L)=0{1/L). 

z=l 

Therefore the total contribution of DISC2 to the discrepancy of an interval 
X + k \s small. For 

msCi{X + k,t) := ^ DisCi(x,t) 

x£X+k 

we have 



|disCi(X + k,t) - Disc(X + k,t)\ 



Y DISC2(x,t) 



0(1). 



We continue as in Section collapsing a sum using inf(z, u) = ^-ff (-z + 1, n — 
1) - \H{z-1,u-1). We also use that Disc(x,t) = DisCi(x,t) = for x ~ t 
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as the starting configuration is odd. 

DISCi(X + k,t) 

J2 5^ARR(y,0) Yl {-lpMy-x,t-Si{y)) 

xeX+k,xr^t+lyeZ Si{y)>t-L^ 
yeZ Si{y)>t-L'^ X ^ 

~H{y-x-l,t-Si{y)-l)^ 
= ^^ARR(2/,0) i-iy{Hiy-k + L',t-Si{y)-l) 

-H{y-k-L',t-Si{y)-l)). 
We separate the two terms in this last expression. With 

L>(m) :=2^ARR(y,0) Y {-'^yH{y-m,t-Si{y)-l) 

yez siiy)>t-L2 

we have 

DlsCi(X + k,t) = ^D{k - L') - ^^D{k + l!). 

Our original goal was to prove an 0(-v/log L) bound on the quadratic average 
of DISC(X + As DlSCi(X + k,t) differs from DISC(X + k,t) by 0(1) 

it is clearly enough to prove the same bound for the quadratic average of 
DlsCi(X + By the last displayed formula it is enough to prove the 

0(-v/IogX) bound on the two parts D{k — L') and D{k + L') separately, both 
for < k < M even. It is therefore enough to bound the quadratic average 
of D{m) for an arbitrary interval / of length M. Here we consider only 
values m ^ t, for other values of m we have D{m) = 0. 

Let to = max(0, i — + 1) be the first time-step considered. For y G Z and 
It ~ y + 1 we have an odd pile at y if and only if ARR(y, u) 7^ ARR(y, u + 2) 
and in this case ARR(y,u) = (— 1)*ARr(?/, 0) for the index i with Si{y) = u. 
We estimate the contribution D(m, y) of a fixed value y to the sum defining 
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D{m). For m ^ t we have 

D{m,y) ■.= 2ARK{y,0) ^ {-iyH{y-m,t-Si{y)-l) 

Si{y)>t-L2 

= ^ {ARR{y,u) - ARR{y,u + 2))H{y -m,t-u-l) 

to<u<t,u~2/+l 

= ^2 ARR{y,u){H{y — m,t — u— 1) — H{y — m,t — u + 1)) 

to+2<M<t,n~j/+l 

+ARR(y, ti)H{y - m,t - h - 1) - ARR(y, t2)H{y -m,t-t2 + 1), 

where ti = ti{y) is either to or tQ + 1, whichever makes ti ~ y + 1 and 
similarly t2 = t2{y) is either t or t + 1, so that t2 ~ y + 1. We have 

D{m) = ^D{m,y) 

and with 

D'{m):='^ ^ ARR(y,'u)(ii'(y-m,t-'u-l)-i7(y-m,t-'U + l)) 

y& to+2<u<t-2 



we have 



\D{m) — D'(m) 



^(ARR(y, ti{y))H{y - m,t - ti{y) - 1) 



- ARR(y, t2{y))H{y -m,t- t2{y) + 1)) 
< Yl YH{y-m,u)<4. 

u£{0,l,t-to-2,t-to-l} 



As before, we ignore the small difference and will prove the 0{^/logL) bound 
on the quadratic average of D'{m) instead of D{m). Computing the square 
and summing over m we get the following. The summations are taken for 
m & I, m ^ t, for y^y2 G and for ui,U2 G [to + 2,t — 3], ui ~ W2 ~ y + 1, 
respectively. 

^^'^(m) = ^ ^ ARR(yi,tii)ARR(?/2,tt2) 
m yi,y2Ui,U2 

■ Y{H{yi - m,t- ui- 1) - H{yi -m,t-ui + 1)) 

m 

■{H{y2 - m, i - U2 - 1) - H{y2 -m,t-U2 + 1)). 
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Let us estimate the contribution to this sum coming from a fixed yi, ui, 

and U2- Disregarding the signs and extending the summation for all m 
(even outside /) the contribution of each of the four terms we get from the 
multiplication is exactly 1. As ui and U2 can take at most values each, 
the total contribution coming from a single value of yi is at most L^. 

Let us obtain the intervals /' and I" from / by extending or shortening it 
at both ends by respectively, i.e., if / = [a, b], then I' = [a — L'^,b + L^], 
/" = [a + L^, b — L^]. If 2/1 is outside /' we have H{yi — m,t — ui — 1) = 
H(yi — m, t — ui + 1) = for all m £ I, therefore such yi has zero contribution 
to X^^-D'^(m). The contribution for fixed yi, y2, ui, and U2 can usually be 
written in closed form using the identity 

H{yi - m, vi)H{y2 - m, V2) = H{yi - 7/2, + V2). 

m 

This identity is valid if we sum over all possible values of m, but for yi G /" 
the contribution of the values m ^ I is zero. Therefore the contribution to 
D'^{m) of the fixed terms yi G 7/2, ui, and U2 is 

ARR(yi, Mi)ARR(y2, U2) '^{H{yi - m,t - ui - 1) - H{yi ~ m,t - ui + 1)) 

m 

■ {H{y2 -m,t-U2-l) - H{y2 -m,t-U2 + 1)) 
= ARR(yi, ni)ARR(y2, U2){H{y, v-2)- 2H{y, v) + H{y, v + 2)), 

where y = yi — y2 and v = 2t — ui — U2- 

To estimate these contributions we first calculate 

H{y, v-2)- 2H{y, v) + H{y, v + 2)= 0{y^/v^ + l/v^)H{y, v + 2). 

The same y = yi — 2/2 value arises exactly once for every yi G /", a total of 
M — 2L^ possibilities. The largest possible value of v is less than 2L^ and 
any single value v can be the result of at most v pairs ui, U2- There are 
4L^ possible values of j/i outside I" but inside /' contributing at most 4L^. 
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Summing for all these contributions we estimate 



m 



AL" + 



AL'' + 




v=i j/ez 



MvY^ivVv' + l/v')Hiy, V + 2) 



v=l yeZ 



V 



0{L^ + M log L). 



Here we used the estimate on the fourth moment of the random walk: 



To finish the proof we set the threshold M > for sufficiently large M. 
We did not make an effort to optimize for this threshold. This ensures that 
Ylm = O(MlogL), so the quadratic average of D'{m) (and therefore 

of Disc(X + k,t)) is 0{^/logL) as claimed. 

It remains to construct a starting configuration where the quadratic average 
of discrepancies in the intervals of length L is large. For our construction 
we do not even use the value L. For a given (even) parameter t, we define 
a probability distribution on starting positions, such that for all L < t and 
all intervals X of length L the expectation of DISC^(X, t) = Q{logL). 

We let r{a,b) stand for independent random ±1 variables for all integers a 
and b > 1. We look for an even starting configuration (guaranteed by the 
Arrow-Forcing Theorem), such that arr(x,u) = r{a,b) for all even x and 
u satisfying 4* < u < 4*+-^ and a2* < x < {a + 1)2*. For simplicity we set 
arr(2;, u) = 1 for all u and all odd x and we also set arr(x, u) = 1 for all 
X and n < 4. 

A simple calculation similar to the one in Section[7|shows that for an interval 
X = [c, d] we have 



y'H{y, v + 2) = Oiiv + 2f) = 0{v^). 
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where the coefficients h{a, b) depend on X. Further analysis shows that ah 
coefficients are bounded and 0(logL) of them are above a positive abso- 
lute constant for each interval of length L. This implies that the expecta- 
tion of DISC^(X, t) is r2(logL), and therefore the expectation of the average 
W Sfcli DiSC^(X -I- k,t) is also J7(logL). This proves the second statement 
of the theorem. □ 
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